Conic section

Types of conic sections:
1. Parabola
2. Circle and ellipse
3. Hyperbola
Table of conics, Cyclopaedia, 1728

In mathematics, a conic section (or just conic) is a curve obtained by intersecting a cone (more precisely, a right circular conical surface) with a plane. In analytic geometry, a conic may be defined as a plane algebraic curve of degree 2. It can be defined as the locus of points whose distances are in a fixed ratio to some point, called the focus, and some line, called the directrix.

The conic sections were named and studied as long ago as 200 BC, when Apollonius of Perga undertook a systematic study of their properties.

Contents

History

Menaechmus

It is believed that the first definition of a conic section is due to Menaechmus. This work does not survive, however, and is only known through secondary accounts. The definition used at that time differs from the one commonly used today in that it requires the plane cutting the cone to be perpendicular to the line that generates the cone as a surface of revolution. Thus the shape of the conic is determined by the angle formed at the vertex of the cone; If the angle is acute then the conic is an ellipse, if the angle is right then the conic is a parabola, and if the angle is obtuse then the conic is a hyperbola. Note that the circle cannot be defined this way and was not considered a conic at this time.

Euclid is said to have written four books on conics but these were lost as well. Archimedes is known to have studied conics, having determined the area bounded by a parabola and an ellipse. The only part of this work to survive is a book on the solids of revolution of conics.

Apollonius of Perga

The greatest progress in the study of conics by the ancient Greeks is due to Apollonius of Perga, whose eight volume Conic Sections summarized the existing knowledge at the time and greatly extended it. Apollonius's major innovation was to characterize a conic using properties within the plane and intrinsic to the curve; this greatly simplified analysis. With this tool, it was now possible to show that any plane cutting the cone, regardless of its angle, will produce a conic according to the earlier definition, leading to the definition commonly used today.

Pappus is credited with discovering importance of the concept of a focus of a conic, and the discovery of the related concept of a directrix.

Omar Khayyám

Apollonius's work was translated into Arabic (the technical language of the time) and much of his work only survives through the Arabic version. Persians found applications to the theory; the most notable of these was the Persian mathematician and poet Omar Khayyám who used conic sections to solve algebraic equations.

Europe

Johann Kepler extended the theory of conics through the "principle of continuity", a precursor to the concept of limits. Girard Desargues and Blaise Pascal developed a theory of conics using an early form of projective geometry and this helped to provide impetus for the study of this new field. In particular, Pascal discovered a theorem known as the hexagrammum mysticum from which many other properties of conics can be deduced. Meanwhile, René Descartes applied his newly discovered Analytic geometry to the study of conics. This had the effect of reducing the geometrical problems of conics to problems in algebra.

Introduction

Conics are of three types: parabolae (1), ellipses, including circles (2), or hyperbolae (3).

The three types of conics are the ellipse, parabola, and hyperbola. The circle can be considered as a fourth type (as it was by Apollonius) or as a kind of ellipse. The circle and the ellipse arise when the intersection of cone and plane is a closed curve. The circle is obtained when the cutting plane is parallel to the plane of the generating circle of the cone – for a right cone as in the picture at the top of the page this means that the cutting plane is perpendicular to the symmetry axis of the cone. If the cutting plane is parallel to exactly one generating line of the cone, then the conic is unbounded and is called a parabola. In the remaining case, the figure is a hyperbola. In this case, the plane will intersect both halves (nappes) of the cone, producing two separate unbounded curves.

Various parameters are associated with a conic section:

conic section equation eccentricity (e) linear eccentricity (c) semi-latus rectum () focal parameter (p)
circle x^2+y^2=a^2 \,  0 \,  0 \,  a \,  \infty
ellipse \frac{x^2}{a^2}+\frac{y^2}{b^2}=1 \frac{\sqrt{a^2-b^2}}{a} \sqrt{a^2-b^2} \frac{b^2}{a} \frac{b^2}{\sqrt{a^2-b^2}}
parabola y^2=4ax \,  1 \,  a \,  2a \,  2a \,
hyperbola \frac{x^2}{a^2}-\frac{y^2}{b^2}=1 \frac{\sqrt{a^2+b^2}}{a} \sqrt{a^2+b^2} \frac{b^2}{a} \frac{b^2}{\sqrt{a^2+b^2}}
Conic parameters in the case of an ellipse

Conic sections are exactly those curves that, for a point F, a line L not containing F and a non-negative number e, are the locus of points whose distance to F equals e times their distance to L. F is called the focus, L the directrix, and e the eccentricity.

The linear eccentricity (c) is the distance between the center and the focus (or one of the two foci).

The latus rectum (2) is the chord parallel to the directrix and passing through the focus (or one of the two foci).

The semi-latus rectum () is half the latus rectum.

The focal parameter (p) is the distance from the focus (or one of the two foci) to the directrix.

The following relations hold:

Properties

Just as two (distinct) points determine a line, five points determine a conic. Formally, given any five points in the plane in general linear position, meaning no three collinear, there is a unique conic passing through them, which will be non-degenerate; this is true over both the affine plane and projective plane. Indeed, given any five points there is a conic passing through them, but if three of the points are collinear the conic will be degenerate (reducible, because it contains a line), and may not be unique; see further discussion.

Irreducible conic sections are always "smooth". More precisely, they never contain any inflection points. This is important for many applications, such as aerodynamics, where a smooth surface is required to ensure laminar flow and to prevent turbulence.

Intersection at infinity

An algebro-geometrically intrinsic form of this classification is by the intersection of the conic with the line at infinity, which gives further insight into their geometry:

Degenerate cases

There are five degenerate cases: three in which the plane passes through apex of the cone, and three that arise when the cone itself degenerates to a cylinder (a doubled line can occur in both cases).

When the plane passes through the apex, the resulting conic is always degenerate, and is either: a point (when the angle between the plane and the axis of the cone is larger than tangential); a straight line (when the plane is tangential to the surface of the cone); or a pair of intersecting lines (when the angle is smaller than the tangential). These correspond respectively to degeneration of an ellipse, parabola, and a hyperbola, which are characterized in the same way by angle. The straight line is more precisely a double line (a line with multiplicity 2) because the plane is tangent to the cone, and thus the intersection should be counted twice.

Where the cone is a cylinder, i.e. with the vertex at infinity, cylindric sections are obtained;[1] this corresponds to the apex being at infinity. Cylindrical sections are ellipses (or circles), unless the plane is vertical (which corresponds to passing through the apex at infinity), in which case three degenerate cases occur: two parallel lines, known as a ribbon (corresponding to an ellipse with one axis infinite and the other axis real and non-zero, the distance between the lines), a double line (an ellipse with one infinite axis and one axis zero), and no intersection (an ellipse with one infinite axis and the other axis imaginary).

Eccentricity, focus and directrix

Ellipse (e=1/2), parabola (e=1) and hyperbola (e=2) with fixed focus F and directrix (e=∞).

The four defining conditions above can be combined into one condition that depends on a fixed point F (the focus), a line L (the directrix) not containing F and a nonnegative real number e\ (the eccentricity). The corresponding conic section consists of the locus of all points whose distance to F equals e\ times their distance to L. For 0 < e < 1 we obtain an ellipse, for e = 1 a parabola, and for e > 1 a hyperbola.

For an ellipse and a hyperbola, two focus-directrix combinations can be taken, each giving the same full ellipse or hyperbola. The distance from the center to the directrix is a/e, where a \ is the semi-major axis of the ellipse, or the distance from the center to the tops of the hyperbola. The distance from the center to a focus is a\,e \ .

In the case of a circle, the eccentricity  e = 0, and one can imagine the directrix to be infinitely far removed from the center. However, the statement that the circle consists of all points whose distance to F is e times the distance to L is not useful, because we get zero times infinity.

The eccentricity of a conic section is thus a measure of how far it deviates from being circular.

For a given a\ , the closer e\ is to 1, the smaller is the semi-minor axis.

Generalizations

Conics may be defined over other fields, and may also be classified in the projective plane rather than in the affine plane.

Over the complex numbers ellipses and hyperbolas are not distinct, since there is no meaningful difference between 1 and −1; precisely, the ellipse x^2+y^2=1 becomes a hyperbola under the substitution y=iw, geometrically a complex rotation, yielding x^2-w^2=1 – a hyperbola is simply an ellipse with an imaginary axis length. Thus there is a 2-way classification: ellipse/hyperbola and parabola. Geometrically, this corresponds to intersecting the line at infinity in either 2 distinct points (corresponding to two asymptotes) or in 1 double point (corresponding to the axis of a parabola), and thus the real hyperbola is a more suggestive image for the complex ellipse/hyperbola, as it also has 2 (real) intersections with the line at infinity.

In projective space, over either the reals or complex numbers, all non-degenerate conics are equivalent, and thus in projective geometry one simply speaks of "a conic" without specifying a type, as type is not meaningful. Geometrically, the line at infinity is no longer special (distinguished), so while some conics intersect the line at infinity differently, this can be changed by a projective transform – pulling an ellipse out to infinity or pushing a parabola off infinity to an ellipse or a hyperbola.

In other areas of mathematics

The classification into elliptic, parabolic, and hyperbolic is pervasive in mathematics, and often divides a field into sharply distinct subfields. The classification mostly arises due to the presence of a quadratic form (in two variables this corresponds to the associated discriminant), but can also correspond to eccentricity.

Quadratic form classifications:

quadratic forms
Quadratic forms over the reals are classified by Sylvester's law of inertia, namely by their positive index, zero index, and negative index: a quadratic form in n variables can be converted to a diagonal form, as x_1^2+x_2^2+\dots+x_k^2-x_{k+1}^2-\dots-x_{k+l}^2, where the number of +1 coefficients, k, is the positive index, the number of −1 coefficients, l, is the negative index, and the remaining variables are the zero index m, so k+l+m=n. In two variables the non-zero quadratic forms are classified as:
In two variables quadratic forms are classified by discriminant, analogously to conics, but in higher dimensions the more useful classification is as definite, (all positive or all negative), degenerate, (some zeros), or indefinite (mix of positive and negative but no zeros). This classification underlies many that follow.
curvature
The Gaussian curvature of a surface describes the infinitesimal geometry, and may at each point be either positive – elliptic geometry, zero – Euclidean geometry (flat, parabola), or negative – hyperbolic geometry; infinitesimally, to second order the surface looks like the graph of x^2+y^2,, x^2 (or 0), or x^2-y^2.. Indeed, by the uniformization theorem every surface can be taken to be globally (at every point) positively curved, flat, or negatively curved. In higher dimensions the Riemann curvature tensor is a more complicated object, but manifolds with constant sectional curvature are interesting objects of study, and have strikingly different properties, as discussed at sectional curvature.
Second order PDEs
Partial differential equations (PDEs) of second order are classified at each point as elliptic, parabolic, or hyperbolic, accordingly as their second order terms correspond to an elliptic, parabolic, or hyperbolic quadratic form. The behavior and theory of these different types of PDEs are strikingly different – representative examples is that the Laplacian is elliptic, the heat equation is parabolic, and the wave equation is hyperbolic.

Eccentricity classifications include:

Möbius transformations
Real Möbius transformations (elements of PSL2(R) or its 2-fold cover, SL2(R)) are classified as elliptic, parabolic, or hyperbolic accordingly as their half-trace is 0\leq |\operatorname{tr}|/2 < 1, |\operatorname{tr}|/2 = 1, or |\operatorname{tr}|/2 > 1, mirroring the classification by eccentricity.
Variance-to-mean ratio
The variance-to-mean ratio classifies several important families of discrete probability distributions: the constant distribution as circular (eccentricity 0), binomial distributions as elliptical, Poisson distributions as parabolic, and negative binomial distributions as hyperbolic. This is elaborated at cumulants of some discrete probability distributions.

Cartesian coordinates

In the Cartesian coordinate system, the graph of a quadratic equation in two variables is always a conic section – though it may be degenerate, and all conic sections arise in this way. The equation will be of the form

Ax^2 + Bxy + Cy^2 +Dx + Ey + F = 0\text{ with }A, B, C\text{ not all zero.} \,

As scaling all six constants yields the same locus of zeros, one can consider conics as points in the five-dimensional projective space \mathbf{P}^5.

Discriminant classification

The conic sections described by this equation can be classified with the discriminant[2]

B^2 - 4AC .\,

If the conic is non-degenerate, then:

Note that A and B are just polynomial coefficients, not the lengths of semi-major/minor axis as defined in the following sections.

Matrix notation

In matrix notation the equation above becomes:

\begin{bmatrix}x & y \end{bmatrix} . \begin{bmatrix}A & B/2\\B/2 & C\end{bmatrix} . \begin{bmatrix}x\\y\end{bmatrix} +Dx +Ey+F= 0.

or

\begin{bmatrix}x & y & 1\end{bmatrix} . \begin{bmatrix}A & B/2 & D/2\\B/2 & C & E/2\\D/2&E/2&F\end{bmatrix} . \begin{bmatrix}x\\y\\1\end{bmatrix} = 0.

and

B^2 - 4AC = -4 \left|\begin{matrix}A & B/2\\B/2 & C\end{matrix}\right| .

This is the usual way of representing a quadratic form by a symmetric matrix, while the factor of -4 is a constant due to the use of B^2-4AC rather than AC-(B/2)^2 in the quadratic formula.

As slice of quadratic form

The equation

Ax^2 + Bxy + Cy^2 +Dx + Ey + F = 0 \,

can be rearranged by taking the affine linear part to the other side, yielding

Ax^2 + Bxy + Cy^2 = -(Dx + Ey + F). \,

In this form, a conic section is realized exactly as the intersection of the graph of the quadratic form z=Ax^2 + Bxy + Cy^2 and the plane z = -(Dx + Ey + F). Ellipses and hyperbolae can be realized by a horizontal plane (D=E=0), while parabolae require that the plane be slanted. Degenerate conics correspond to degenerate intersections, such as taking slices such as z=-1 of a positive-definite form.

Standard form

Through change of coordinates these equations can be put in standard forms:

Such forms will be symmetrical about the x-axis and for the circle, ellipse and hyperbola symmetrical about the y-axis.
The rectangular hyperbola however is only symmetrical about the lines y = x and  y = -x. Therefore its inverse function is exactly the same as its original function.

These standard forms can be written as parametric equations,

Modified form

Three different types of conic sections. Focal-points corresponding to all conic sections are placed at the origin.

For some practical applications, it is important to re-arrange standard form so that focal-point can be placed at the origin. Then, mathematical formulation for a general conic section is given by,

Polar form:

r=\frac{l}{1-e\cos\theta}

Cartesian form:

\begin{alignat}{7}
\sqrt{x^{2}+y^{2}} = \left(l+e x\right) \\
\Rightarrow\left(\frac{x-\frac{le}{1-e^{2}}}{\frac{l}{1-e^{2}}}\right)^{2}+\frac{\left(1-e^{2}\right)y^{2}}{l^{2}} = 1 
\end{alignat}

From the above equation, linear eccentricity (c) is given by c=\left(\frac{le}{1-e^{2}}\right).

From the general equations given above, different conic sections can be represented as shown below:

Homogeneous coordinates

In homogeneous coordinates a conic section can be represented as:

A_1x^2 + A_2y^2 + A_3z^2 + 2B_1xy + 2B_2xz + 2B_3yz = 0.

Or in matrix notation

\begin{bmatrix}x & y & z\end{bmatrix} . \begin{bmatrix}A_1 & B_1 & B_2\\B_1 & A_2 & B_3\\B_2&B_3&A_3\end{bmatrix} . \begin{bmatrix}x\\y\\z\end{bmatrix} = 0.

The matrix M=\begin{bmatrix}A_1 & B_1 & B_2\\B_1 & A_2 & B_3\\B_2&B_3&A_3\end{bmatrix} is called the matrix of the conic section.

 \Delta = \det(M) = \det\left(\begin{bmatrix}A_1 & B_1 & B_2\\B_1 & A_2 & B_3\\B_2&B_3&A_3\end{bmatrix}\right) is called the determinant of the conic section. If Δ = 0 then the conic section is said to be degenerate, this means that the conic section is in fact a union of two straight lines. A conic section that intersects itself is always degenerate, however not all degenerate conic sections intersect themselves, if they do not they are straight lines.

For example, the conic section \begin{bmatrix}x & y & z\end{bmatrix} . \begin{bmatrix}1 & 0 & 0\\0 & -1 & 0\\0&0&0\end{bmatrix} . \begin{bmatrix}x\\y\\z\end{bmatrix} = 0 reduces to the union of two lines:

 \{ x^2 - y^2 = 0\} = \{(x+y)(x-y)=0\} = \{x+y=0\} \cup \{x-y=0\}.

Similarly, a conic section sometimes reduces to a (single) line:

\{x^2+2xy+y^2 = 0\} = \{(x+y)^2=0\}=\{x+y=0\} \cup \{x+y=0\} = \{x+y=0\}.

 \delta = \det\left(\begin{bmatrix}A_1 & B_1\\B_1 & A_2\end{bmatrix}\right) is called the discriminant of the conic section. If δ = 0 then the conic section is a parabola, if δ<0, it is an hyperbola and if δ>0, it is an ellipse. A conic section is a circle if δ>0 and A1 = A2 and B1 = 0, it is an rectangular hyperbola if δ<0 and A1 = -A2. It can be proven that in the complex projective plane CP2 two conic sections have four points in common (if one accounts for multiplicity), so there are never more than 4 intersection points and there is always 1 intersection point (possibilities: 4 distinct intersection points, 2 singular intersection points and 1 double intersection points, 2 double intersection points, 1 singular intersection point and 1 with multiplicity 3, 1 intersection point with multiplicity 4). If there exists at least one intersection point with multiplicity > 1, then the two conic sections are said to be tangent. If there is only one intersection point, which has multiplicity 4, the two conic sections are said to be osculating.[3]

Furthermore each straight line intersects each conic section twice. If the intersection point is double, the line is said to be tangent and it is called the tangent line. Because every straight line intersects a conic section twice, each conic section has two points at infinity (the intersection points with the line at infinity). If these points are real, the conic section must be a hyperbola, if they are imaginary conjugated, the conic section must be an ellipse, if the conic section has one double point at infinity it is a parabola. If the points at infinity are (1,i,0) and (1,-i,0), the conic section is a circle. If a conic section has one real and one imaginary point at infinity or it has two imaginary points that are not conjugated it is neither a parabola nor an ellipse nor a hyperbola.

Polar coordinates

In polar coordinates, a conic section with one focus at the origin and, if any, the other on the x-axis, is given by the equation

r=\frac{l}{{1\pm e\cos\theta}},

where e is the eccentricity and l is the semi-latus rectum (see below). As above, for e = 0, we have a circle, for 0 < e < 1 we obtain an ellipse, for e = 1 a parabola, and for e > 1 a hyperbola.

Development of the conic section as the eccentricity e increases

Applications

Conic sections are important in astronomy: the orbits of two massive objects that interact according to Newton's law of universal gravitation are conic sections if their common center of mass is considered to be at rest. If they are bound together, they will both trace out ellipses; if they are moving apart, they will both follow parabolas or hyperbolas. See two-body problem.

In projective geometry, the conic sections in the projective plane are equivalent to each other up to projective transformations.

For specific applications of each type of conic section, see the articles circle, ellipse, parabola, and hyperbola.

Intersecting two conics

The solutions to a two second degree equations system in two variables may be seen as the coordinates of the intersections of two generic conic sections. In particular two conics may possess none, two or four possibly coincident intersection points. The best method of locating these solutions exploits the homogeneous matrix representation of conic sections, i.e. a 3x3 symmetric matrix which depends on six parameters.

The procedure to locate the intersection points follows these steps:

See also

Notes

  1. "MathWorld: Cylindric section". http://mathworld.wolfram.com/CylindricSection.html. 
  2. Fanchi, John R. (2006), Math refresher for scientists and engineers, John Wiley and Sons, p. 44-45, ISBN 0-471-75715-2, http://books.google.com/books?id=75mAJPcAWT8C , Section 3.2, page 45
  3. Wilczynski, E. J. (1916), "Some remarks on the historical development and the future prospects of the differential geometry of plane curves", Bull. Amer. Math. Soc. 22: 317–329 .

References

External links